Vocal Tract Warping for Normalizing Inter-Speaker Differences in Vocal Tract Transfer Functions
نویسندگان
چکیده
Vocal tract warping functions for normalizing vocal tract transfer functions of seven male subjects were calculated based on a vocal tract deformation method based on the vocal tract length sensitivity function. Vocal tract area functions for the five Japanese vowels of six subjects were tuned for their first four formant frequencies to be close to those of a target subject. The vocal tract warping functions were obtained as relationship between the original and deformed area functions. The results indicate that (1) the warping functions are not linear functions, (2) the vocal tract length of the deformed area functions are different from that of the target subject, and (3) the shape of the warping functions of the five vowels are not constant for each subject.
منابع مشابه
Speaker normalization based on frequency warping
In speech recognition, speaker-dependence of a speech recognition system comes from speaker-dependence of the speech feature, and the variation of vocal tract shape is the major source of inter-speaker variations of the speech feature, though there are some other sources which also contribute. In this paper, we address the approaches of speaker normalization which aim at normalizing speaker's v...
متن کاملتخمین سریع ضرایب پیچش در هنجارسازی طول مجرای صوتی با استفاده از امتیاز به دست آمده از مدلسازی تشخیص جنسیت
The performance of automatic speech recognition (ASR) systems is adversely affected by the variations in speakers, audio channels and environmental conditions. Making these systems robust to these variations is still a big challenge. One of the main sources of variations in the speakers is the differences between their Vocal Tract Length (VTL). Vocal Tract Length Normalization (VTLN) is an effe...
متن کاملSpeaker normalization based on test to reference speaker mapping
The paper presents the speaker normalization technique we implemented in a teaching and training system for hearing handicapped children with the goal to reduce inter-speaker variability in time-frequency speech representation. In an effort to reduce variance caused by variation in vocal tract shape among speakers, a formant based nonlinear frequency warping approach to vocal tract normalizatio...
متن کاملA frequency warping approach to speaker normalization
In an effort to reduce the degradation in speech recognition performance caused by variations in vocal tract shape among speakers, this thesis studies a set of lowcomplexity, maximum likelihood based speaker normalization procedures. By approximately modeling the vocal tract as a simple acoustic tube, these procedures compensate for the effects of the variations in vocal tract length by linearl...
متن کاملVocal Tract Length Normalization for Large Vocabulary Continuous Speech Recognition
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-dependent speech feature. The variation of vocal tract length and/or shape is one of the major source of inter-speaker variations. In this paper, we address several methods of vocal tract length normalization (VTLN) for large vocabulary continuous speech recognition: (1) explore the bilinear warping VTL...
متن کامل